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Sir: 

Under the provisions of 35 U.S.C. § 119 and 37 C.F.R. 

§ 1.55(a), the applicant (s) hereby claim (s) the right of 
priority based on the following application (s) : 

Country Application No. Filed 

EUROPE 01401476.5 June 7, 2001 



A certified copy of the above-noted application (s) is (are) 
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credit any overpayment to Deposit Account No. 02-244 8 for any 
additional fee required under 37 C.F.R. §§ 1.16 or 1.17; 
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TRANSCRIPTIONAL REGULATOR OF GENES INVOLVED IN THE CONTROL 
OF CELL GROWTH OR CELL PROLIFERATION. USE OF SAID REGULATOR 
AS A THERAPEUTIC OR DIAGNOSTIC AGENT. 

The present invention relates to a transcriptional regulator of genes involved 
in control of cell growth and cell proliferation and relates to the use of the gene for 
the preparation of therapeutic or diagnostic means. 

The development of cells toward a malignant state is controlled in the 
human body by numerous parameters including the expression of specific genes, 
some of which are constitutively expressed in cells. Considerable effort has been 
directed in the past years towards the identification of the mechanisms of 
malignant transformation of cells particularly with the aim of identifying new 
therapeutic approaches and tools including products, in order to remedy to said 
transformation toward a malignant cell or in order to stop progression of such a 
state toward the formation of tumors and in some instances, in order to reverse the 
malignant state towards a normal state. 

In the field of cancer research, some genes have been identified in cells, the 
regulation of which is affected during the transformation of cells toward a 
malignant state. Such genes include tt anti-oncogenes " , some of which have been 
identified, cloned and sequenced. Among these genes, p53 and IRF1 (Interferon 
Regulatory Factor 1) have been well characterised and it has been shown that 
their maintenance in cellular growth control during genotoxic damage or aberrant 
proliferation is required in order to prevent the development of a malignant cell. 

Starting from the observation that said p53 and IRF1 genes are involved in 
preventing the transformation of cells toward a malignant state, the inventors have 
studied their regulation and have found ways to interact with the control functions 
controlled by these genes. 

The present invention offers means which can interfere with the 
transformation of cells toward a malignant state, or which can be used to control 
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the spreading of such cells including by enabling their apoptosis, said means being 
at least to some extent capable of overcoming some drawbacks and especially the 
insufficient level of activity of the p53 or IRF1 genes observed when these genes 
are used to design therapeutic protocols. 

-j= Ire — i n-t i on — i n ci — offe rs _ t^i e — p>o s s i t=» i l fty— to— c^o n tro I — tfn tra nscri p>ti o n— o^— s» — 

tn-oacl spectrum of genes having functions in the maintenance of normal growth or 
differenciation of cells. 

A publication (Lallemand C. et al, 1997) has disclosed the presence in the 
promoter regions of both the IRF1 and p53 anti-oncogenes of a sequence denoted 
"IRF1 p53 common sequence" (IPCS) which markedly increases the 
transcriptional activity of a reporter gene placed under the control of a 
heterologous promoter in transfected U937 cells. 

According to this publication, the IPCS sequence is defined as a 
degenerated motif replying to the sequence AAATGRYKKCMMS (IUAP code) 
which sequence has been shown according to said publication to bind a nuclear 
factor denoted IPCS-binding factor (IPCS-BF). Said IPCS-BF factor is composed 
of a single polypeptide of 26 kDa which is present constitutively in nuclear extracts 
of U937 cells and Peripheral Blood Mononuclear cells from healthy donors. 
According to the publication of Lallemand C. et al, it is believed that the 
transcriptional regulation of the p53 and IRF1 genes is influenced by the binding of 
said IPCS-BF factor, to the IPCS sequence. 

The inventors have now identified a novel transcription factor denoted 
GAAP-1 (Gatekeeper of Apoptosis Activating Proteins) which influences the 
transcription regulation of at least the p53 and IRF1 genes. According to the 
present invention, this GAAP^I factor binds specifically to the IPCS sequence 
found in the promoter regions of both p53 and IRF1 genes and corresponds to a 
protein of around 75 kDa which is an alternative splice product of a known gene, 
PRDIJ-BF1 (Fan CM. and Maniatis, T., 1990 ; MuchardtC. etal, 1992). 
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The PRDII-BF1 gene has been disclosed as a gene encoding a 300 kDa 
cellular protein with two widely spaced zinc fingers. It has also been presented as 
enabling the alternative production of spliced gene products encoding respectively 
proteins of 200 and 70 kDa. The spliced forms of the PRDII-BFI cDNA have been 
disclosed in the above-cited publication of Muchardt C. et al, showing that the 
short transcript is devoid of the nucleotide sequence of the PRDII-BF1 gene 
corresponding to exon IV. 

Accordingly, this short transcript gives rise to a cDNA sequence comprising 
as a coding sequence, the exon I, ll f III, V, VI, VII, VIII and IX of the PRDII-BF1 
gene and thus, lacks the sequence comprised between positions 416 and 6394 of 
the gene as disclosed in Muchardt et al 1992 with reference to Fan et al 1 990. 

The invention thus identified a novel transcription factor GAAP-1 interacting 
with a specific sequence present in the regulation region of several genes and 
therefore opens a new way for the definition of a means useful in a therapeutic, 
diagnostic or prognostic approach of various pathologies including those related to 
malignant cells. 

The invention thus relates to the use of a nucleotide sequence selected 
among: 

(i) a nucleotide sequence comprising the DNA sequence identified 
under No. 1 , and represented on Figure 14 

(ii) a nucleotide sequence encoding a polypeptide having the amino-acid 
sequence identified under No. 2, and represented on Figure 15 

(iii) a nucleotide sequence derived from sequence defined under (i) or (ii) 
wherein said sequence is modified particularly by deletion, addition or 
substitution of one or more nucleotides providing that the resulting 
nucleotide sequence encodes a polypeptide capable of binding a 
nucleotide sequence designated IPCS which comprises the DNA 
sequence AAATGNNNNC, wherein N means any nucleotide (G, A, C 
orT(U)) 
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for the expression in a determined eucaryotic cell, of a polypeptide capable of 
interacting with said nucleotide sequence designated IPCS and capable of acting 
as a positive transcriptional factor for the transcription of a nucleotide sequence 
placed under the control of said IPCS sequence and present in said eucaryotic 

The above nucleotide sequence encodes the transcription factor of the 
invention, in conditions enabling its interaction and especially its binding with the 
identified IPCS sequence. 

The IPCS sequence has been defined in the prior publication of Lallemand 
C et ai 1997, and appears as a consensus sequence of 6 nucleotide residues in a 
sequence of generally 10 nucleotides. The sequence is disclosed by reference to 
the IUPAC code and therefore the positions GNNNN can respectively correspond 
to nucleotide guanine and to any nucleotides (G, A, C or T(U)). 

Particular IPCS sequences have been found in the p53 and IRF1 genes, 
which have the following DNA sequence: AAAATGATTTCCAC (for the p53 gene) 
and GAAATGACGGCACG (for the IRF1 gene). This sequence has also been 
identified in other anti-oncogenes such as Rb, p21 (WAF1), p27, wt1, bax, TNF 
receptor and FAS genes. 

These IPCS sequences are described hereafter by reference to the gene 
containing the same. 
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Gene 


Pos. 


Sequence IPCS 


Ref. genebank 


Rb 


680 


aaatgtattc 


L11910 


P21 


1138 


aaatgtattc 


U24170 


P21 


1876 


aaatgaaaac 


U24170 


P21 


2031 


aaatggtgac 


U24170 


P21 


4314 


aaatgtgtcc 


U24170 


P27 


292 


aaatggcaac 


AB003688 


Wt1 


1828 


aaatgggctc 


X74840 


Bax 


679 


aaatggtgcc 


U17193 


Bax 


694 


aaatgaaggc 


U17193 


TNFr 


946 


aaatgaacac 


U53483 


FAS 


580 


aaatgtcaac 


U31968 



The DNA sequence identified under No.1 is the sequence presented in 
Figure 14 and designated GAAP-1. 

The aminoacid sequence identified under No2. is presented in Figure 15 
and designated GAAP-1 . 

The deletion, insertion or substitution in the above defined nucleotide 
sequences include point mutation or mutation of several nucleotides provided the 
properties of the encoded polypeptide in accordance with the present invention, 
are retained in the mutated sequence. According to a preferred embodiment such 
a mutated sequence retains the zinc finger domains of the native sequence. 

The invention relates especially to the use of a nucleotide sequence 
replying to the above definition, for the expression of a polypeptide acting as a 
positive transcriptional factor for a gene chosen among genes involved in the 
control of cellular growth, cellular proliferation, cellular differenciation or cellular 
apoptosis. The genes include the above cited antioncogenes. 

The expression "control" of the cellular growth, cellular proliferation, cellular 
differenciation or cellular apoptosis means within the invention, that the used 
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nucleotide sequence is capable of influencing, directly or indirectly; alone or as a 
result of a combined effect with other cellular or non cellular factors, qualitatively or 
quantitatively, the growth, proliferation, differenciation or apoptosis of cells. 

In a particular embodiment of the invention, the deletion of the nucleotide 
— sequenceof theinvention canleadtodifferenciationor to spontaneous apoptosis — 
or to induced apoptosis by a genotoxic drug. 

A possible assay to detect the influence of the use of the nucleotide 
sequence in this "control" is to compare the effect of the activity of the expression 
product of this nucleotide sequence on the expression of a determined gene, for 
example the p53 or the 1RF1 genes, in a determined cell, to the expression of the 
same gene in the same cell, in the absence of expression of the nucleotide 
sequence of the invention. 

According to a first preferred embodiment of the invention, the nucleotide 
sequence which is used comprises the cDNA corresponding to the 2083 
nucleotide (nt) coding sequence of the transcript of the PRDII-BF1 gene. 

According to another particular embodiment of the invention, the nucleotide 
sequence which is used is modified with respect to the 2.5kb sequence or to the 
coding sequence represented in Figure 14, especially by deletion of the 
nucleotides corresponding to exons I and II of the PRDII-BF1 gene and a start 
codon is added upstream from exon III of the PRDII-BF1 gene to enable the 
expression of a polypeptide in a eucaryotic cell. 

Thus in a particular embodiment of the invention, the nucleotide sequences 
which are used comprise the succession of exons III, V, VI, VII, VIII and IX of the 
PRDII-BF1 gene, said nucleotide sequence being devoid of exon IV of said PRDII- 
BF1 gene. * 

According to another particular embodiment of the invention, the nucleotide 
sequence which is used codes for a GAAP-1 polypeptide comprising the amino 
acid sequence identified under No. 2. 



Alternatively, the nucleotide sequence used codes for a variant of GAAP-1 
polypeptide, said variant being derived from GAAP-1 by insertion, deletion or 
substitution of one or several amino acid residues, provided it retains the property 
of GAAP-1 to bind an IPCS sequence and to act as a transcriptional factor for the 
for the transcription of a nucleotide sequence placed under the control of said 
IPCS sequence and present in a eucaryotic cell. 

The invention also encompasses the use of variants of said nucleotide 
sequence, especially shorter sequences, which are defined by their capacity to 
hybridise under stringent conditions with the DNA sequence identified under No. 1 . 
Stringent conditions according to the invention require an hybridisation 
temperature of 60°C and 3 washing steps in NaCI 40 mM SDS 0,5%. 

The hybridized transcripts are then identified and selected for instance by 
Northern blot, or electrophoretic migration. 

For the purpose of the invention, the nucleotide sequence which is used 
may be placed under the control of a promoter sequence selected among 
constitutive or inducible promoters in order to enable expression of a polypeptide. 
The selection of the regulation elements and especially of the promoter is 
dependant upon the final purpose of the expression sought. The promoters are 
those usually used by the skilled person in the art and may be for example 
selected among the following: CMV, 5V40, p-actin, eF1ct. 

A particularly preferred use of a nucleotide sequence according to the 
above definitions is characterised by the fact that the eucaryotic cells transformed 
by insertion of said nucleotide sequence are malignant cells. 

A "malignant cell" according to the invention is a cell whose growth, 
differenciation or proliferation if altered as a result of genotoxic damage, whatever 
their origin including as a result of ionising, radiation, genotoxic drug, viral 
oncogene etc... 

In another particular use, the eucaryotic cells transformed by insertion of 
one of the above nucleotide sequences are cells of a developed tumor. 
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In another preferred embodiment of the invention, the use of a nucleotide 
sequence according to the invention enables the control of cell apoptosis. 

The effect of controlling cell growth, cell differenciation, cefl proliferation or 
cell apoptosis can be achieved in accordance to the invention through the positive 
~~ regulatiorTof transcriptionarac^ 
(WAF1), p27, wt1, bax, TNF receptor and FAS, or advantageously of several 
genes present in the treated cells, thus possibly enabling an increased efficiency. 

As a result of the presence of the IPCS sequence in the promoter regions of 
several genes involved in the control of cell growth, cell differenciation, cell 
proliferation or cell apoptosis, the invention provides a way to broadly interact with 
this function of control of multiple genes, through a common pathway and with 
possibly one type of compound. The invention thus provides means which can be 
more efficient for the treatment of alteration of the above control. 

The invention also relates to a nucleotide sequence comprising the DNA 
sequence identified under No. 1 , said nucleotide sequence being devoid of the 
sequence forming exon IV in the PRDII-BF1 gene. 

In a particular embodiment of the invention, this nucleotide sequence 
consists of the DNA sequence identified under No. 1 (Figure 14). 

This DNA sequence can be denoted cDNA sequence since it may have 
been isolated from a cDNA library. By the expression "cDNA" the invention not 
only encompasses the full-length DNA sequence which is obtained by reverse 
transcription of the messenger RNA. It also encompasses DNA sequences which 
comprise or consist of the coding sequence corresponding to an Open Reading 
Frame of the transcript of the gene, or any coding sequence which is sufficient 
- when used in appropriate conditions, to enable the expression of a polypeptide 
having the properties defined in the present invention. 

The coding sequence of the invention may thus be placed under the control 
of an heterologous promoter for the expression of the polypeptide, and if 
appropriate under the control of additional or alternative regulatory sequences. 
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The invention therefore encompasses said coding sequence included in a 
recombinant nucleotide sequence comprising regulatory sequence for cloning 
and/or expression in a host cell or in a target cell. 

Particular nucleotide sequences derived from the above defined nucleotide 
sequences are those selected from the group consisting of: 

(i) a fragment of the DNA sequence identified under No. 1 (Figure 14), 
which can be used as a specific probe to detect the presence of said 
DNA sequence identified under No. 1 or a mutated sequence thereof, 

(ii) a nucleotide sequence encoding a polypeptide having the amino-acid 
sequence identified under No. 2, 

(iii) a nucleotide sequence derived from sequence defined under (i) or (ii) 
wherein said sequence is modified, especially by deletion, addition or 
substitution of one or more nucleotides providing that the resulting 
nucleotide sequence encodes a polypeptide capable of binding a 
nucleotide sequence designated IPCS which comprises the DNA 
sequence AAATGRYKKC, and is capable when used in appropriate 
conditions, of expressing in a determined eucaryotic cell, a 
polypeptide interacting with the nucleotide sequence designated 
IPCS and acting as a positive transcriptional factor for the 
transcription of a nucleotide sequence placed under the control of 
said IPCS sequence and present in said eucaryotic cell. 

A preferred nucleotide sequence is that of GAAP-1 contained on plasmid 
pGAAP-1 deposited at the ECACC (Great Britain) under n° 01052921 on May 29, 
2001. 

The invention however does not relate to the PRDII coding sequence, as 
such, being the sequence designated PRDII on figure 14. It does not relate also to 
the encoded sequence designated PRDII which appears on figure 15. 

PRDII coding sequence and amino acid sequence are different from GAAP- 
1 sequences at positions marked up in figures 14 and 15. 
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The invention also relates to a recombinant polypeptide, being the product 
of the expression in a eucaryotic cell, of a nucleotide sequence coding for a 
polypeptide capable of interacting with the nucleotide sequence designated IPCS 
and capable of acting as a positive transcriptional factor for the transcription of a 
-nucleotide_sequence-placed-under_the-ControLof-said-IRCS sequence-and-present - 
in said eucaryotic cell. 

In a preferred embodiment, the nucleotide sequence placed under the 
control of said IPCS sequence is a gene involved in the control of cell growth, cell 
differenciation, cell proliferation or cell apoptosis. 

According to another definition of the invention, the recombinant polypeptide 
of the invention is the product of the expression in a eucaryotic cell, of a nucleotide 
sequence as defined above. 

According to a particular embodiment of the invention, the recombinant 
polypeptide encompassed within the above definitions is further recognised by its 
capacity to regulate the transcriptional activity of a gene selected among p53 and 
IRF1 when it is expressed in a eucaryotic cell constitutively expressing said gene. 

In order to assay the capacity of such a polypeptide to achieve regulation of 
transcriptional activity of genes, either of pre-determined or of unknown genes, 
various cell lines can be used, including U937 (ATCC CRL 1593), K562 (ATCC CL 
243), SK-N-SH (HTB 11), MCF7 (HTB 22), or KG-1 (CL 246). These cells are 
transfected with a nucleotide sequence of the invention and their binding capacity 
to the IPCS sequence of the p53 or IRF1 genes expressed in the tested cell line is 
detected and subsequent transcriptional activity is assayed. 

Various methods are described in the art for the detection of the binding 
capacity of a compound acting as a trans-acting factor. Methods for identifying 
DNA-protein complexes are for example those involving analysing the product by 
gel retardation assays described in "Molecular Cloning - A laboratory Manual" 
(Sambrook and Russell, Cold Spring Harbor Laboratory Press, 2001). Methods for 
the analysis of transcripts of genes are also well known in the art. Such methods 
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include for example analysing primary transcripts by transcriptional run-on as 
described in "Molecular Cloning - A laboratory Manual" above cited. Other 
methods also described in "Molecular Cloning - A laboratory Manual" include 
having recourse to reporter genes as markers of the activity of regulatory 
elements, and as example using CAT (chloramphenicol acetyltransferase), 
luciferase, or (3-galactosidase genes, EGFP (Enhanced Green Fluorescent 
Protein). 

The recombinant polypeptide according to the invention can also be defined 
as a polypeptide having a theoritical molecular weight of 74,7 kDa. 

The molecular weight of GAAP-1 expressed in vivo has not been precisely 
determined since it is expressed as a fusion protein after transfection of Hela cells 
with a recombined nucleotide sequence GAAP-1/EGFP. The transfected cells 
have been fractionated by electrophoresis on SDS-PAGE gel and revealed after 
blotting by an antibody directed against EGFP. The molecular weight of the fusion 
protein is between 98 and 64 kDa. 

in a particular embodiment, the recombinant polypeptide, being the 
expression product in a eucaryotic cell is expressed in a mammalian cells, 
especially in a human cell, and is modified as a result of post translational 
modifications occurring in the eucaryotic cells, including proteolytic cleavage 
and/or phosphorylation, ubiquitinization, sumoylation. 

Phosphorylation can be detected on the expressed recombinant product, by 
using antibodies directed against the potentially phosphorylated amino acid 
residues, i.e. tyrosine, serine and threonine. 

A variant recombinant polypeptide according to the invention can be 
obtained as a result of the mutation of the phosphorylated amino acid residues, 
provided the mutated polypeptide retains the binding and transcriptional activity of 
the original one. 

A preferred recombinant polypeptide according to the invention comprises 
the amino acid sequence identified under No. 2 (Figure 15). 



12 



Another particular polypeptide according to the invention has an aminoacid 
sequence shorter and comprised within the the amino acid sequence identified 
under No. 2 or a variant thereof replying to the definitions given above, provided 
the obtained polypeptide is capable of interacting with the nucleotide sequence 
designated-IPGS-to act-as a-positive transcriptional factor-for-the transcription_of_a — 
nucleotide sequence placed under the control of said IPCS sequence and present 
in a eucaryotic cell. 

The invention further concerns recombinant eucaryotic cells which are 
recombined by insertion of a nucleotide sequence according to the definitions 

provided hereabove. 

In a particular embodiment, the recombinant eucaryotic cell expresses the 
polypeptide encoded by the inserted nucleotide sequence, which polypeptide has 
the above defined properties. 

The insertion of the nucleotide sequence of the invention can be achieved 
by transfection having recourse to known methods such as electroporation or other 
methods including those described in "Molecular Cloning - A Laboratory Manual" 
(Cold Spring Harbour Press - 2001). 

Particular recombinant eucaryotic cells of interest are selected among cells 
normally expressing genes involved in the control of cell growth, cell 
differenciation, cell proliferation or cell apoptosis. 

Such cells include, but are not limited to, cell lines used for the study of the 
properties of the expressed products, such as U937, K562, SK-N-SH, TF1, MCF7, 
or KG-1 cell lines. 

In a particular embodiment of the invention, especially in relation with the 
proposed use of the described means for a therapeutic, diagnostic or prognostic of 
alteration of the regulation of the transcription of genes involved in the control of 
cell growth, cell differenciation, cell proliferation or cell apoptosis, recombinant 
eucaryotic cells are cells obtained from a patient, are especially malignant cells or 
cells constituting a potential candidate for the occurrence of said alterations. 
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The recombinant eucaryotic cell according to the invention can also be 
defined as a cell which is recombined with a nucleotide sequence encoding a 
polypeptide capable of binding a nucleotide sequence designated IPCS and 
comprising the DNA sequence AAATGNNNNC, for the expression in a determined 
eucaryotic cell of a polypeptide capable of interacting with the nucleotide sequence 
designated IPCS and capable of acting as a positive transcriptional factor for the 
transcription of a nucleotide sequence placed under the control of said IPCS 
sequence and present in said eucaryotic cell. 

In a particular recombinant eucaryotic cell according to the invention, the 
polypeptide comprises the amino acid sequence identified under No. 2 (Figure 15). 

Particular recombinant eucaryotic cells useful for the invention are selected 
among stem cells or differenciated cells including U937, K562, SK-N-SH, MCF7, 
KGI,TF1. 

The invention also concerns recombinant polypeptides which are fusion 
polypeptides including those wherein the polypeptide of the invention is fused with 
a reporter sequence, for instance fused with EGFP or EGFP... FclgGL 

The invention is further directed to a recombinant vector which is 
recombined with an insert consisting of a nucleotide sequence of the invention. 

Particular recombinant vectors are expression vectors, suitable for 
expression of said insert, in a eucaryotic cell, especially in cells selected among 
the above cited cells. 

In a particular embodiment of the invention, the recombinant vector is 
suitable for transient expression or for controlled expression of said insert. 
Transient expression may be required for therapeutic purposes in order to avoid 
undesirable effects of the expression of the inserted construct in cells. Controlled 
expression includes the possibility of obtaining induced expression, or more 
generally the possibility to control the level of the expression of the inserted 
construct. 
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In order to obtain said controlled expression, said insert can be placed 

under the control of an inducible promoter, advantageously a promoter regulated 

by a physiologically acceptable compound. Inducible promoters are for example 

IPTG-inducible promoters such as the trp-lac (tac) promoter, the trp-lac (trc) 

-promoter,-the-lacpromoter,_which-are-used-in-pUC 1 -pSKpTZi-pGEMy~pBlueseritp- 
vectors. 

In another particular embodiment of the invention, in the recombinant vector 
the transcription of said insert is placed under the control of an exogenous 
transactivating system. Particular artificial systems for the control of the 
transcription encompass the tetracycline modulated system for example with the 
use of a pTRE vector (Clontech), the rapamycin inducible system Ponasteron 
Inducible System. 

The recombinant vector according to the invention is advantageously 
designed in such a way that it is suitable for gene therapy. 

For this purpose the recombinant vector can be selected among viral, 
retroviral, lentiviral, poxviral, adenoviral, AAV vectors. The known requirements 
and advantages for the design of safe and efficient vectors should be taken into 
consideration in preparing these vectors. A review of the techniques in this field is 
available in La Therapie genique (Editions TEC & DOC, 2001). 

The invention also concerns a composition suitable for therapeutic use, 
which comprises a nucleotide sequence as defined above, or a recombinant cell 
according to the invention, or a recombinant polypeptide of the invention. 

Such a composition may be used in a particular embodiment, in 
combination with a therapeutic agent selected among antiviral agents and 
anticancer agents. The combination means that the composition of the invention 
may be associated with the anticancer agent in the protocol. The administration of 
these associated compounds can be done under the form of a single composition, 
or alternatively, the composition of the invention and the anticancer agent can be 
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administered as separate compositions, including the possibility for a simultaneous 
concomitant or sequential administration in time. 

The anticancer agent can be an immuotherapeutic agent, a 
chemotherapeutic agent or can be radiotherapy. Appropriate agents include 5FU, 
cisplatine, etoposide, cytokines. 

The combined use propose may enable to reduce the level of the efficient 
doses of the anticancer agent used. 

The composition of the invention can be included in a therapeutic protocol 
for the treatment of malignant cells or for the treatment of tumors. 

The expression "treatment" within the present invention encompasses the 
effect produced on the control of cell growth, cell differenciation, cell proliferation, 
tumorigenicity, cell apoptosis which in order to inhibit, circumvent or even reverse 
the development of the malignant state, including the capacity of preventing or 
inhibiting the growth and/or spreading of a tumor. 

Cell differenciation can be assayed, for instance for erythroleucemic cells 
(K562, UT7) by their capacity to produce haemogloblin. 

Cell proliferation can be evaluated by measurement of the reduction of MTT 
or a similar assay. The cells can be injected to nude mice in order to determine 
their tumorigenic capacity. 

Apoptosis can be assayed by the "TUNEL M method or by labeling with 
annexin V. 

The invention thus relates to the use of a nucleotide sequence, or of a 
recombinant cell, or of a recombinant polypeptide, for the preparation of a 
biological tool for screening compounds capable of interacting with an IPCS 
sequence present in the promoter sequence of a gene involved in the control of 
cellular growth, cellular proliferation, cellular differenciation or cellular apoptosis, 
and capable of regulating the transcription of a nucleotide sequence placed under 
the control of said IPCS sequence. 
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The inventors have defined a novel approach for the regulation of genes 
involved in the control of the evolution of cells toward a malignant cells which 
approach offers means interacting with a potentially broad spectrum of target 
genes to control their regulation. 

— ^Fhe mvention-further-discloses-novel means useful-for " the detection of an- 

alteration of the control of the regulation pathway of the transcription of genes 
involved in the control of cell growth, cell differenciation, cell proliferation, cell 
apoptosis. The defined means are based on the detection of an abnormal BRDII- 
BFI gene in cells, either as a result of a mutation affecting the transcription of the 
gene normally producing the spliced form of messenger RNA corresponding to the 
nucleotide sequence of the invention, or affecting the proper translation of said 
transcript. The presence of an abnormal gene can be correlated to a potential 
deficient control of genes containing in their promoter region a sequence of the 
IPCS type and as a consequence, to the possibility for a cell to develop as a 
malignant cell or to the possibility for a malignant cell to be treated according to the 
invention. 

The invention therefore relates to a process for the in vitro detection of a 
deficient BRDII-BFI gene comprising the steps of: 

- contacting a probe constituted from the nucleotide sequence identified 
under No. 1 or a fragment thereof comprising the zinc finger binding 
domains corresponding to the domains localised within exon VI of the 
BRDII-gene, with the DNA of a cell normally constitutively expressing said 
gene, in stringent conditions, 

- detecting the hybridisation product between said probe and said cell DNA. 

- — One appropriate probe can be the nucleotide sequence spanning exon 5 to 
exon 9, i.e., nucleotides 6396 to 9020 of the sequence identified under No. 1 . 

The invention also provides a process for the in vitro detection of deficient 
transcriptional activity of genes involved in the control of cell growth, cell 
differenciation, cell proliferation or cell apoptosis, comprising the step of detecting 
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deficient production of the transcript of said gene which would normally encode a 
polypeptide capable of binding an IPCS sequence and as a result would positively 
regulate the transcription of a nucleotide sequence placed under the control of said 
IPCS sequence. 

According to another embodiment of the invention, a process is defined for 
the in vitro detection of a prognostic of transformation of cells toward a malignant 
state, which comprises the step of detecting a mutation in the PRDII-BF1 gene 
normally expressed in said cells, or detecting a mutation in the transcript obtained 
by splicing of said gene, which mutation would result in lack of expression or in an 
abnormal expression of polypeptide expression product of said PRDII-BF1 gene 
capable of binding to an IPCS sequence. 

The invention also concerns a process for the screening of compounds 
capable of regulating the transcriptional activity of genes containing an IPCS 
sequence in their promoter region, said genes being active in the control of cell 
growth, cell differenciation, cell proliferation, or cell apoptosis, said process 
comprising the steps of : 

- contacting the assayed compounds with the DNA of a cell expressing genes 
containing an IPCS sequence in their promoter region, 

- detecting a DNA-compound complex formation and assaying its 
transcriptional activity on said gene containing the IPCS sequence. 

The DNA of the cell can be used under the form of a nuclear extract of the 
cell. 

The invention also relates to the use of the described polypeptide for the 
screening of molecules, especially small molecules having interest for 
administration to patients, wherein said molecules are selected either for ther 
agonist properties with respect to GAAP-1 , or for their antagonist properties 
with respect to GAAP-1 . 

Agonists of GAAP-1 may be suitable for the preparation of anti-cancer 
agents. Antagonists of GAAP-1 may be suitable for the preparation of 
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compositions having interest in the treatment of inflammatory or 
autoimmune diseases. 

Numerous methods are available for the screening of such compounds 
especially for the detection of the DNA-compound formation. This step can be 
-carried— out_using-gel--retardation-assay,-supershift-assay T -Gompetition-assay— 
Techniques involving immunoblotting or immunoprecipitation assays may also be 
appropriate. 

The identification of compounds capable of inhibiting or stabilising the 
formation of the IPCS/GAAP-1 complex, can be carried out by a gel shift 
technique. 

The identification of compounds inhibiting or favoring the recruitment of 
other proteins capable of acting as partner with GAAP-1 can be carried out by co- 
immunoprecipitation. 

Identification of compounds capable of inducing translocation between 
cytoplasm and nucleus can make use of immunohistochemistry. 

Compounds capable of inducing postranslational modifications of GAAP-1 
such as phosphorylation, ubiqu ionization, sumoylation, acetylation can be 
identified by Western blot. 

The invention will be further illustrated by the following examples and 
figures. 

LEGEND OF THE FIGURES 

Figure 1 A Schematic representation of the reporter construct used to clone IPCS- 
BF1. The IPCS from p53 or IRF1 gene or the mutated IPCS is cloned upstream 
His3 minimum promoter. 

Figure 1B Confirmation of the IPCS-BF1 binding specificity. After the 
transformation by IPCS-BF1/Ga14AD (IPCS-BF1/Ga14) fusion protein or empty 
expression vector (control) the IPCS yeast reporter strain are spread on selective 
medium lacking histidine. As each reporter strain possess its own leaky expression 
of HIS3, the 3AT was adjusted at for IPCS-p53, IPCS-IRF1 and IPCSM-P53 at 15, 
25 and 40 mM respectively. 
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Figure 2A- Schematic representation of the exon composition of PRDII-BF1 and 
IPCS-BF1. This data are deduced from the work of gaynor (91) and Xu (Xu 99) 
The nucleotide positions are as in Fan & Maniatis (1990). 

Figure 3 - Deduced amino acid sequence of IPCS-BF1. The putative NLS, the 
two zinc fingers domain and the PEST domain are framed. 

Figure 4a) - Plasmid constructs encoding an EGFP/GAAP-1 fusion protein for 
transient expression. 

Figure 4b) - Sub-cellular localisation of EGFP/GAAP-1 variants. HuH7 cells were 
transiently transfected with the expression vector indicated. After 24h, the nuclei 
were stained with Hoechst 33342 and examined by confocal microscopy. 
Figure 5a) - EMSA using the IPCS-p53 probe and in vitro translated GAAP-1 
(lanel ), GAAP-1 mtZ1 (lane 3), GAAP-1 mt Z2 (lane 5), or reticulocyte lysat with 
the empty expression vector (lane 7). Even lanes: 50 fold molar excess of IPCS- 
p53 oligonucleotide was added to the reaction of the previous lane. The arrow 
indicates the specific complex. 

Figure 5b) The same as in B except that the IPCS-p53 probe was replaced by the 
NF-kB oligonucleotide. 

Figure 5c) Amino acid sequence of the two zinc finger domains. The residues 
which are substituted in GAAP-1 mtZ1 or in GAAP-1 mtZ2 are underlined. 
Figure 6 - Determination of post-translational modifications of GAAP-1. Western 
blot analysis of 20 ug of cellular extract from HeLa cells stably transfected with 
either an EGFP, EGFP/GAAP-1, GAAP-1/EGFP, EGFP/GAAP-1 del PEST 
expression vector, and probed with anti-EGFP antibodies. 

Figure 7 - Analysis of the effect of intramolecular cleavage on the DNA binding 
properties of GAAP-1, by EMSAs with nuclear extracts of K562 cell stably 
transfected with EGFP/GAAP-1 expression vector. 

Figure 8 - Relative activity of the wild type and mutated p53 and TRF1 promoters, 
IPCS-SV40 chimeric promoter, and the HIV-LTR in a co-transfection assay. U937 
cells were tansiently co-transfected with 3 ug of the reporter plasmids and 7 ug of 
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the expression vectors indicated /The pcDNA3/i vector was used as a control. The 
pRL-SV40 plasmid was used as an internal control for the normalisation of the 
transfection efficiency. The transfected cells were harvested 8h after transfection 
and assayed for lucif erase activity. 

Figure 9 — U-937-and-K562 cells-were stably-transfected-with~GAAP-1-or-EGFP - 
expression vectors and selected using G418 in semi-solid methyl-cellulose 
medium. The number of clones was determined at 21 days. 

Figure 10 - Involvement of GAAP-1 in apoptosis. Spontaneous apoptosis of U937 
cells stably transfected with EGFP or EGFP/GAAP-1 expression vectors was 
assayed by annexin V-PE/7-AAD double labelling. 

Figure 11 - Western blot analysis of 20 pg of cellular extract from U937 cells 
stably transfected with either an EGFP or EGFP/GAAP-1 expression vector, and 
probed with anti-p53 or anti-IRF1 antibodies. 

Figure 12 - U937 cells stably transfected with either the EGFP or EGFP/GAAP-1 
fusion protein expression vectors were treated or not with 1mM 5FU. Apoptosis 
was analysed by the TUNEL assay 72 hours after treatment. 

Figure 13 - Influence of expression of GAAP-1 on spontaneous or induced 
differentiation by A2ra-c or Hemin of K562 erythroid cells. 
Figures 14 and 15- GAAP-1 and PRDII sequences 
Figure 16 - Map of plasmid pGAAPI (ECACC 01052921) 

EXAMPLES 

The constitutive expression of the tumour suppressor genes p53 and IRF1 
(interferon regulatory factor 1) is required for the maintenance of the cellular 
growth control during genotoxic damage or aberrant proliferation. Following DNA 
damage, the activity of p53 is enhanced and its level rises considerably, mainly by 
protein stabilisation. p53 exerts its effect by transcriptional induction of genes 
involved in cell growth arrest which allows DNA repair or eventually apoptosis 
(Vogelstein 00 ) Mutations in the coding region of the p53 gene, which have been 
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observed in more than 50% of human cancers (Soussi 00), inactivation of the 
protein by the association with the viral oncoproteins E1A or E6, or deregulation of 
factors implicated in its turnover are the most frequent causes for an altered 
response to DNA damage (Vogelstein 00). However, the absent or weak 
expression of the p53 gene has also been observed in various tumoural cell lines 
(Raman 00) where this gene is apparently fully functional. This suggests that 
transcriptional deregulation of the p53 gene can be considered among the 
mechanisms involved in the loss of p53 function. 

The transcription factor IRF1 has been originally identified as a positive 
regulator of the interferon system (Miyamoto Fujita). Accumulating evidence also 
suggests that IRF-1 controls tumour susceptibility. Thus, the transformed 
phenotypes of c-myc or fos-B expressing cells can be suppressed by ectopic 
expression of IRF-1 (Tanaka 94). In addition, while c-HA-ras is unable to transform 
primary mouse fibroblasts (MEFs), it does transform fibroblasts originating from 
knockout mice for IRF1 (Tanaka 94a). IRF1 is necessary for genotoxic stress cell 
cycle arrest (Tanaka 96) and is responsible for the activation of at least two genes, 
p21waf1 and p27kip1, which play a central role in cell cycle control (Tanaka 1996 
Coccia99, Moro 00 ). Furthermore, IRF1, like p53, is involved in the induction of 
the apoptotic genes caspases (Furukawa 99) and Bax. IRF-1 is also implicated in 
the FAS apoptotic pathway. Indeed, the FasL promoter possess a specific binding 
site for IRF-1 which directly controls the inducibility of FasL by TCR (Wa 00). 
Alterations of the IRF-1 gene are also associated with various diseases. 
Chromosomal deletion or inactivation of one of the two alleles of the IRF1 gene 
are commonly associated with human leukemic or preleukemic syndromes 
(Taniguchi 1995). Loss of functional IRF1 mRNA by skipping specific exons has 
been reported in 20% of patients with myelodysplastic syndrome (Harada 94). 
Finally, loss of heterozigosity at the IRF1 locus is frequently observed in patients 
with gastric or esophagic cancers (Ogasawara 96 Tamura 96). 
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The expression of the p53 gene is inducible by a number of different stimuli. 
Several different regulatory elements are present within the p53 promoter, such as 
a genotoxic stress response sequence, extending from positions -70 to -40 (24), a 
binding site for c-Myc-Max heterodimers (25), two overlapping binding sites for 
-NF-1-and-Y¥1-from-positions -227-to -194,-implieated-in-the constitutive expression 
of the p53 gene, (27) and a site for HOXA5 at position -204/-201 . HOXA5 is a 
positive regulator of p53 transcription in the normal breast epithelium and lack of 
its expression may be the primary cause for loss of p53 expression in human 
breast cancer cells (Raman 00). 

The expression of the IRF1 gene is strongly induced by both IFN-alpha/beta 
and IFN-gamma. This property is due to the presence of a regulatory element 
called IR (inverse repeat) within the IRF1 promoter (28), which also contains 
several putative Sp1 sites, probably involved in the basal transcriptional activity of 
this gene (21, 28). 

The p53 and IRF1 promoters also contain NF-kB sites (28, 30), which 
confer upon each gene inducibility by a variety of different agents such as 
oxidative stress, IL-1p, and TNF-a. Other common features of the two genes are 
the constitutive and- ubiquitous expression in the quasi-totality of untransformed 
tissues (22) and the absence of TATA boxes (28, 31 , 32). 

The present invention describes the cloning of the GAAP-1 cDNA and 
presents data on the structure and function of this factor. The structure of GAAP-1 
has been correlated to some of its features such as sub-cellular localisation, DNA 
binding and transactivation activity. 

Experimental Procedure 

Cloning of GAAP-1 

S. cerevisiae YM4271 and the reporter vector pHISi were obtained from Clontech. 
The reporter constructs were generated by inserting one copy of the double 
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stranded oligonucleotides 5' AAAATGATTTCCAC3' , *-AAAACGATTTCCAC-3\ 5'- 
GAAATGACGGCACG3' , corresponding respectively to the IPCS-p53, IPCS-p53M 
and IPCS-IRF1 sequences previously identified (Lallemand 97), into the EcoRI and 
Xbal sites of the pHISi. These plasmids were linearised and used to transform 
YM2471 competent cells. The yeast reporter strains were maintained by selection 
on synthetic dextrose medium lacking histidine. As each yeast reporter strain 
possess its own leaky expression of HIS3, 20mM, 30mM and 45mM of 3-AT (3- 
amino-1,2 f 4-triazole) were added to the medium to suppress the background 
growth of respectively IPCS-p53, IPCS-IRF1 and IPCS-p53m yeast reporter 
strains. Screening of the human leukocyte cDNA library (Clontech) encoding 
proteins fused with the GAL4 activation was performed by the lithium acetate 
method (Gietz 92) in a yeast strain carrying the HIS3 reporter gene under the 
control of one IPCS-p53 sequence. The transformed yeast cells were plated under 
selective conditions in synthetic medium containing 20mM 3AT and lacking 
histidine and leucine. In order to confirm the transactivation properties of positive 
clones, plasmids were recovered from these clones by a rapid isolation procedure 
(Kaiser & Auer 1993), and used to transform E. coli. A mini-preparation (Wizard, 
Promega) of these plasmids were then used to transform yeast strains carrying the 
HIS3 reporter gene under the control of the IPCS-p53, IPCS-IRF1 or IPCS-p53M 
sequences. Only plasmids which can confer auxotrophy to histidine to strains 
containing IPCS-p53/HIS3 and IPCS-IRF1/HIS3 and non IPCS-p53M were 
retained. 

Cell lines and drugs 

U937 cells (ATCC CRL 1593), derived from a human histiocytic lymphoma and 
K562 (ATCC CL 243), derived from a human myeloid leukemia were cultivated in 
RPMI 1640 medium supplemented with 10% fetal calf serum (Life Technologies, 
Inc.) Hela cells (ATCC CCL-2) were cultivated in DMEM medium supplemented 
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with 10% fetal calf serum. 5 fluorouracil, cytosine 0-D-Arabinofuranoside and 
hemin were purchased from Sigma-Aldrich. 

Plasmids 

_jphe-GAAP-1-expression-veetor-was-eonstrueted-by-cloning-the-PeR-product(nt 
6396 to 8601) from human leukocyte cDNA library (Clontech Laboratories) in the 
pcDNA3.1 expression vector (Invitrogen) using the following primers: 5'- 
AGCATGGCATTAGGTAATCAAAAGTCCACAG-3' 

5'CCATCAGGTTGCTATCACAAGC-3' . The plasmid expressing the EGFP/GAAP- 
1 fusion protein was obtained by fusion PCR (Higuchi, R., In PGR protocols, a 
guide to methods and applications (Eds Innis, M.A., Gelfand,,) using the following 
primer couples (5*-ACCATGGTGAGCAAGGGCGA-3' , 5'- 

GATTACCTAATGCTCTCTTGTACAGCTCG-3') and (5*- 

GGACGAGCTGTACAAGATGGGGCAGAAGTTTCAAAA-3' , 5' 
CCATCAGGTTGCTATCACAAGC -3') to amplify respectively the EGFP and 
GAAP-1 coding region. The EGFP/GAAP-1 del NLS, the GAAP-1 mtZ1 and the 
GAAP-1 mtZ2 were obtained by overlapping PCR (Higuchi above) using the 
following overlapping primer couples (5'- 
CCTTAATCAAAAGTGAAGATGGAGGATATAAGTCA-3' , 5'- 
ATCCTCCATCTTCACTTTTGATTAAGGAATT-3'), (5*- 
GCAGAAGAAGCTGGAATACGTTGTAAGAAAC-3' , 5' - 
AGCTTCTTCTGCAATGTATTTTCCTCTTCC-3' ) and (5'- 
GCAATGAAGTCCAAGGCAGGAAGCAAGAAATGTGTGGA-3', 5' 
TCCTGCCTTG G ACTTCATTGCTTTTGTCAG ATTTCC-3' ) respecitvely. All the 
coding regions were cloned in the expression ' vector pcDNA3.lV (Invitrogen). The 
integrity of the constructs was verified by sequencng. The pRL-SV40 was 
purchased from Promega (Promega). 



Transfection 
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Transfection was performed with 10 |jg DNA per 10 6 cells / 0.8 ml medium by 
electroporation as described in Lallemand 97. Cells were transfected with DNA 
using Superfect (Qiagen) according the manufacturer's instructions. The pRL- 
SV40 (Promega) was used as an internal control for the normalization of the 
transfection efficiency. The transfected cells were harvested 24h after transfection 
and assayed for luciferase activity using the Dual-luciferase Reporter assay 
System (Promega). 

Western Blot Analysis 

Ten micrograms of nuclear extract, prepared as described below, were denatured 
by boiling for 5min in loading buffer (100nM Tris, 12,1% glycerol, 2.4% SDS, 1.4% 
b-mercaptoethanol, 2.3% bromophenol Blue), then fractionated in 12% SDS- 
PAGE and transferred onto PVDF membrane filter. The immunodetection was 
carried out as described elsewhere (ref) with anti-GFP (Zymed), anti-p53 (pAb 
240) (Santa Cruz Biotechnology Inc.) or anti-IRF1 (Santa Cruz Biotechnology Inc.) 
antibodies. Signals were detected with an ECL kit (Amersham). 

Nuclear extracts 

Nuclear extracts were prepared using a modification of the procedure described by 
Osbom et al. (34). Briefly, 10 7 cells were washed twice with phosphate-buffered 
saline, lysed with 20 pi of a buffer containing 10 mM HEPES, pH 7.9, 0.1% 
Nonidet P-40 (Fluka), 1.5 mM MgCI2, 10 mM KCI, 0.5 mMDTT, and the following 
protease inhibitors: 1 mM phenylmethylsulfonylfluoride, 50 MQ/ml [alpha ]- 
phenylmethylsulfonyl fluoride, and 5 pg/ml each of leupeptin, pepstatin, aprotinin, 
and antipain). Samples were incubated for 15 min on ice, and the cellular lysate 
was vortexed briefly and centrifuged in a microcentrifuge for 10 min at 4 °C. 
Nuclear pellets were suspended in 15 p' of extraction buffer (20 mM HEPES, pH 
7.9, 25% glycerol, 420 mM NaCI, 1.5 mM MgCI2, 0.2 mM EDTA, 0.5 mM DTT, 
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together with the same protease inhibitors as described above), and incubated for 
15 min at 4 °C, mixed briefly, and centrifuged in a microcentrifuge for 10 min at 4 
°C. The supernatant was then added to an equal volume of storage buffer (20 mM 
HEPES, pH 7.9, 20% glycerol, 0.2 mM EDTA, 0.5 mM DTT, together with the 
same protease inhibitors as ^e^ib^"above)"ari^"theTT storedlit"-70" 0 CT 

EMSA 

Synthetic double stranded oligonucleotide probes were labelled with a- 32 P dCTP 
(Amersham Corp.) by "filling-in" with Sequenase, and then purified on Sephadex 
G-50. The protein-DNA binding reactions were carried out using 5 ug of nuclear 
extract and 0.1 fmol of probe in 20 ul of binding buffer (20 mM Tris-HCI, 25 mM 
KCI, 1 mM DTT, 1 mM EDTA, 0.1% Nonidet P-40,4% glycerol, 1 mg/ml bovine 
serum albumin, ug/ml poly(dl-dC)) for 20 min at 4 "C. In vitro translation products 
were generated by 50ul of TNT coupled reticulocyte Lysate System (Promega) and 
1ug of the appropriate plasmid. 5ul of the in vitro product was used instead of the 
nuclear extract when required. The reaction mixture was then electrophoresed on 
a 6% non denaturing acrylamide gel. Competition experiments were carried out 
using a 50-fold molar excess of the unlabelled . probe. The following 
oligonucleotides (and their complementary strands) were used in these studies: 
IPCS-IRF1 , 5'-AGCCTGATTTCCCCGAAATGACGGCACGCAGCC-3*;IPCS-p53, 
5'-AATGCAGGATTCCTCCAAAATGATTTCCAC-3' ; NF-kB-MHC 5'- 

GATCCTCTGGGGATTCCCCATGG A-3' . 

Detection of apoptosis 

Spectrophotometric assay for the hemoglobin levels 

The hemoglobin levels were determined by a slight modification of a previously 
described method (Ray 96). 5x1 0 5 cells were washed twice with PBS, 
resuspended in 35ul of passive lysis buffer (Promega). The cell lysate was 
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centrifuged at 14000rpm in a microfuge for 15 min a 4°C. 30ul of supernatant was 
mixed with 100ul of freshly prepared benzidine-HCI (10mg/ml in 0.5% acetic acid) 
and 1ul of 30% H2O2 in a 96 wells microtiter plate. After 90s the absorbance at 
604nm was measured in a microtiter plate reader (ref). 

RESULTS 

Cloning and characterisation of the GAAP-1 cDNA. 

A common sequence exists in the p53 and IRF1 promoters (IPCS) which 
has been reported previously to bind a constitutive factor present in nuclear 
extracts of U937 cells and to direct the transcriptional activation of a reporter gene. 
This constitutive factor has been identified as a 26 kDa protein consisting in a 
single polypeptide chain (Lallemand 97). 

To identify complementary DNAs encoding proteins able to interact with the 
IPCS sequence we used a yeast one-hybrid system. A yeast tester strain, YIPCS, 
was established by introducing the reporter plasmid, pHISi-IPCS, into strain 
YM4271 (Clontech) (HisUraLeu). The pHISi-IPCS contained 14-bp of IPCS from 
the p53 promoter (IPCS-P53) corresponding to the sequence AAAATGATTCCAC, 
upstream of the minimal promoter of the HIS3 gene of the pHISI vector (Clontech 
laboratories). Strain YIPCS was transformed by a human leukocyte cDNA library 
(Clontech), in which cDNAs were fused with the GAL4 activation domain, and 
plated on medium lacking leucine and histidine and containing 20 mM 3-AT. From 

10^ transformants in this manner, we isolated a 2.5 kb cDNA clone coding for a 
protein fused with the GAL4 activation domain. To control the specificity of the 
interaction of this protein with IPCS, we transformed yeast cell lines harbouring a 
HIS3 reporter gene under the control of different IPCS sequences (fig. 1 A) with the 
2.5 kb cDNA clone. The hybrid protein was able to up-regulate the HIS3 gene 
controlled by the wild type IPCS from p53 and IRF1 genes, but not the one 
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controlled by a mutated IPCS (Fig 1B). This result suggests that the protein coded 
by the 2.5 kb cDNA clone possess binding properties for the IPCS sequence from 

\ the p53 promoter (nucleotides 6396 to 6488). 

Sequence analysis of the 2.5 kb cDNA, called GAAP-1 cDNA, revealed 95% 

— homology-to-the-human-PRDII-BF1-cDNA-(Fan-90 etcr) in a region comprised 

between nt 6488 and nt 9020 (relative to the ATG of PRDII-BF1 in Fan 90) (fig. 
2A). It was previously demonstrated that a 2.5 kb mRIMA could be produced from 
the PRDII-BF1 gene by skipping exon 4 f thus it contained exon 3 directly fused to 
exon 5 (Gaynor 91). Except for the first 90 bp, which are not encompassed by the 
GAAP-1 cDNA and are substituted by the sequence coding for the GAL4 activation 
domain in the clone obtained from the cDNA library, GAAP-1 cDNA corresponds 
exactly to the transcript of 2.5 kb described by Gaynor et al. 

Examination of the amino acid sequence encoded by the GAAP-1 cDNA 
revealed the presence of three major motifs (fig 3) : a putative NLS (PRRIKIF) 
distinct from the putative NLS previously identified (Fan 90 Baldwin), two C2-H2- 
type Zinc fingers previously characterised (Fan 90, Gaynor 91, Baldwin 90), and 
an acidic-amino acid rich domain similar to a PEST sequence (Rechsteiner 96) 
whose function is still unknown. 

Sub-cellular localisation of GAAP-1 

As GAAP-1 cDNA did not encompass the NLS already identified in PRDII- 
BF1 (9 kb) at 4158, Baldwin 90), we tested the activity of the NLS present in 
GAAP-1. 

In order to determine the sub-cellular localisation of GAAP-1, we performed 
transient transfection assays with various plasmid constructs encoding an EGFP- 
GAAP-1 fusion protein (fig 4A). To facilitate determination of the localisation of 
fluorescence, we used the human hepatoma mono-layer cell line HuH7 (Bressac 
90). As shown in fig 4B., the fluorescence of the wild type EGFP-GAAP-1 was 
localised both in the nucleus and cytoplasm. This distribution suggests that the 
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activity of GAAP-1 may be regulated by the control of its compartmentalisation. 
The fluorescence of the NLS deleted mutant was restricted to the cytoplasm, 
demonstrating that the PRRIKIF motif was indeed required for the import into the 
nucleus. More surprisingly, transfection of the fusion protein deleted of the PEST- 
like sequence produced a strictly nuclear fluorescence. Similar results have also 
been obtained in the diploid human fibroblastic cell line MRC5 (Maziere 00) (data 
not shown). All these data suggest that GAAP-1 possess a least two domains 
implicated in its translocation: an NLS, which is distinct from the previously defined 
NLS of PRDII-BF1, and a PEST-like sequence which may allow the export of the 
protein from the nucleus, in order to be degraded into the cytoplasm. 



DNA binding activity of GAAP-1 

PRDII-BF1 was previously described as a protein binding to an NF-kB site whose 
ability to recognise the DNA is determined by two pairs of zinc fingers, one of them 
present in GAAP-1 (Fan 90, Gaynor, Baldwin). However, the cloning of GAAP-1 
was realised using a sequence unrelated to the NF-kB sequence, suggesting that 
this factor was able to bind to at least two non homologous DNA sequences. To 
confirm the binding specificity of the cloned GAAP-1 for the IPCS target sequence, 
in vitro translated GAAP-1 was incubated with labelled IPCS or NF-kB 
oligonucleotide and analysed by Electro Mobility Shift Assay (EMSA). As shown in 
fig. 5A, GAAP-1 was able to form one complex with IPCS (Fig. 5A, lane 1) and two 
complexes with the NF-kB oligonucleotide (Fig. 5, lane 1), demonstrating the 
double specificity of GAAP-1 binding. Furthermore, the affinity of in vitro translated 
GAAP-1 was considerably stronger for the NF-kB probe than for the IPCS-P53 
probe. The formation of the slower migrating complex observed on the NF-kB 
sequence (fig 5B, lane 1) suggested an homodimerisation of GAAP-1. As GAAP-1 
possess two zinc fingers, we hypothesised that each finger was responsible for the 
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binding of each type of sequence. We constructed two GAAP-1 mutants, each 
containing either of the zinc fingers, the other being destroyed by substitution of 
the cysteine or histidine, necessary for the tertiary structure of a finger, by an 
arginine (fig 5C). In EMSA, the two in vitro translated mutants could not form any 

complex-with-either-IPGS-p53-(Fig-5A,-lanes-3 ? 5)-)-or-NP-KB-(Eig-5B,_lane 3,5) 

probe. Thus, both zinc fingers were required for the binding of GAAP-1 to both 
IPCS and NF-kB sequences. 

The efficiency and the specificity of GAAP-1 binding to DNA may be under 
the control of a post-transcriptional mechanism. In order to clarify whether GAAP-1 
was subject in vivo to post-translational modifications, we performed a Western 
blot analysis with nuclear extracts of K562 cells stably transfected with various 
GAAP-1 /EGFP fusion protein expression vectors (fig 4A) and an anti-GFP 
antibody. The choice of K562 cells was justified by the fact that this cell line can 
harbour a strong constitutive expression of GAAP-1 (see below). As shown in Fig 
6, although the in vitro translated GAAP-1 /EGFP migrated at its theoretical 
molecular weight of 102 kD, GAAP-1 /EGFP and EGFP/GAAP-1 expressed in 
K562 cells migrated respectively at about 35 kD and 95 kD (Fig 6, lanes 2 and 4). 
As the anti-EGFP antibody detected only the N-or C-terminal portions of these 
fusion proteins, we can deduce that in K562 cells, GAAP-1 was cleaved at 
approximately 100 residues from its C-terminus. In addition, this proteolysis was 
independent of the PEST sequence because the GAAP-1 /EGFP deleted of this 
sequence migrated at the same molecular weight as the wild type protein (fig. 6, 
lane 3). 

To analyse the effect of the intramolecular cleavege on the DNA binding 
properties of GAAP-1 , we performed EMSAs with nuclear extracts of K562 cells 
stably transfected with the EGFP/GAAP-1 expression vector. Indeed, in the in vivo 
produced EGFP/GAAP-1 the EGFP was not released from the DNA binding part of 
GAAP-1, in contrast to GAAP-1 /EGFP, so the fusion protein could be distinguished 
from endogenous GAAP-1. As shown in fig. 7, nuclear extracts of EGFP 
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transfected cells formed two complexes with the IPCS probe, containing probably 
the endogenous GAAP-1 (lane 1). Nuclear extracts of EGFP/GAAP-1 vector 
transfected cells gave rise to a similar pattern of migration with increased intensity 
mainly of band A (fig. 7, lanes 6-9). Thus, the addition of the EGFP tag to GAAP-1 
did not seem to alter the electrophoretic mobility of the DNA-protein complexes. 
Competition with IPCS p53 or IPCS IRF1 oligonucleotides showed that the 
interaction was specific (fig. 7, lane 7 and 8), and competition with the NF-kB 
oligonucleotide showed that in vivo produced EGFP/GAAP-1 possessed a double 
specificity of binding (fig 7, lane 9). The formation of complex A was inhibited by an 
anti-GFP antibody proving that it contained EGFP/GAAP-1 (lane 10). The residual 
complex was probably formed by endogenous GAAP-1. Indeed, the addition of 
anti-EGFP antibody did not affect the formation of the complex A containing only 
the endogenous GAAP-1 (lane 5). The faster migrating complex B (lane 6) was 
also inhibited by anti-EGFP antibody, but showed a specificity lower than that of 
complex A (lanes 7 and 8). A control EMSA performed with the in vitro translated 
EGFP/GAAP-1 and either the IPCS-p53 or NF-kB-MHC probe showed that the 
anti-EGFP antibody did not form super-shifted complexes, but rather inhibited the 
DNA binding activity of EGFP/GAAP-1 (data not shown). Fig. 7 presents also the 
results of an EMSA performed with the NF-kB probe of MHC. This experiment 
demonstrated that the DNA binding properties of GAAP-1 (lane 11-14) or the 
fusion protein EGFP/GAAP-1 (lane 16-19) were qualitatively similar on both IPCS 
and NF-kB probes, although an higher efficiency of binding was detected on the 
IPCS probe in contrast to the in vitro result. 

All these data strongly suggest that in K562 cells GAAP-1 is post 
translationally modified by at least a proteolytic cleavage near the C-terminus. 



32 



Trancriptional activity of GAAP-1 

To investigate the contribution of GAAP-1 to the transcriptional activity of 
p53 and IRF1 promoters, we performed co-transfection analysis in K562 cells. We 
used plasmids containing the luciferase reporter gene under the control of p53 
— (p53Huc)-or-|RF=1-(IRF1-ta 
IPCS as described previously (Lallemand 97). Alternatively, we used chimeric 
promoters consisting of an IPCS minimum binding site, or a mutated version of this 
site, cloned upstream of the SV40 promoter. Co-transfection of the GAAP-1 
expression vector increased the p53-luc, IRF1-luc or IPCS-SV40-luc reporter 
activity approximately 2,5 times compared to a similar experiment performed using 
the same reporter vectors with an empty expression vector (fig 8). In contrast, 
luciferase activity expressed by constructs containing the mutated IPCS sequence 
was not modified by co-transfection with the GAAP-1 expression vector and was 
comparable to the activity obtained by co-transfection of wild type reporter vectors 
and an empty expression vector. These data demonstrated that GAAP-1 
possessed a positive transcriptional activity. As IRF1 and p53 promoters contain 
several functional and putative NF-kB sites (Sims 96 Wu 94), which remained 
unchanged in the IPCS mutated promoters constructs, our results demonstrated 
that although GAAP-1 could bind the NF-kB sequence, this sequence was not 
implicated in the transcriptional regulation of IRF1 and p53 by GAAP-1. Lastly, we 
performed co-transfection experiments using an HIV-LTR/luciferase reporter 
construct which contains NF-kB binding sites. The failure of GAAP-1 to increase 
the gene reporter activity of this construct confirmed that GAAP-1 was 
trancriptionally active on an IPCS and not on an NF-kB sequence. This 
observation is in accordance with the data of Gaynor et ai. which showed that the 
product of the 2.5 kb mRNA obtained from the differentially spliced PRDII-BF1 
mRNA precursor (identical to GAAP-1) was not able to transactivate an LTR-HIV 
reporter construct. On the contrary, a similar experiment performed using a PRDII- 
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BF1 expression vector resulted in an increase in LTR-HIV or NF-kB reporter 
construct activity (data not shown). 

Involvement of GAAP-1 in apoptosis 

The transcriptional activation of IRF-1 and p53 promoters by GAAP-1 
suggested that this factor may possess growth-suppressive and/or apoptotic 
properties. To test this possibility, U937 and K562 cell lines, which express both 
IRF-1 and p53, and neither gene respectively, were transfected with GAAP-1 or 
EGFP/GAAP-1 expression vectors, and tested for their colony-forming ability. The 
number of colonies obtained from GAAP-1 or EGFP/GAAP-1 transfected U937 
cells was one tenth of that obtained from cells transfected with the EGFP control 
plasmid (figure 9). The level of fluorescence of EGFP/GAAP-1 transfected clones, 
determined by flow cytometry, was also much lower than that obtained with 
U937/EGFP cells (data not shown), suggesting that a strong over expression of 
GAAP-1 was lethal for these cells. In subsequent subculture steps to obtain cell 
lines, most of the cell clones transfected with GAAP-1 or EGFP/GAAP-1 triggered 
spontaneous apoptosis (fig 10). The clones in which the level of fluorescence was 
very low, seemed to proliferate in a manner similar to the parental cell line and 
were thus used for the following experiments. 

Western blot analysis of nuclear extracts showed that p53 and IRF-1 were 
induced 4-6 fold in the U937 EGFP/GAAP-1 cell line compared to U937 control 
cells (fig 11), demonstrating that GAAP-1 was implicated in the expression of 
endogenous p53 and IRF1. The increase in p53 and IRF-1 levels was correlated 
with a higher susceptibility of the cells to apoptosis. Treatment for 3 days with 
doses of the antimetabolic drug 5-fluorouracil (5FU), which were subapoptotic in 
U937 control cells, induced apoptosis in the U937 EGFP/GAAP-1 cell lines, as 
determined by TUNEL assay (fig 12). 
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In K562 cells the number of colonies obtained with the EGFP/GAAP-1 and 
GAAP-1 vectors, or the control vector EGFP, was very similar. The level of 
expression of the fusion protein EGFP/GAAP-1, determined by flow cytometry, 
was heterogeneous among the clones. By Western blot analysis p53 and IRF1 
were not detected in either parental or transfected cells . Cell lines expressing 
GAAP-1, EGFP/GAAP-1 or EGFP alone were established and their susceptibility 
to apoptosis was analysed. Treatment with 5FU or cytosine (3-D-Arabinofuranoside 
(Ara-C) did not cause a significative difference in apoptosis susceptibility between 
GAAP-1 or EGFP/GAAP-1 transfected cells and control cells. Thus, the failure to 
trigger apoptosis by over-expression of GAAP-1 correlated with the lack of p53 and 
IRF1 genes expression in K562 cell line. However, in all K562 cell lines expressing 
either GAAP-1 or EGFP/GAAP-1 we observe an approximately 2-5 fold increase in 
the amount of hemoglobin of about 2.5 fold (fig 13), which was comparable to the 
level of hemoglobin produced by the parental clone treated with Ara-C or hemln. In 
addition, the expression of GAAP-1 or GAAP-1 /EGFP strongly potentiated the 
differentiation inducing activity of Ara-C or hemin. These results indicates that 
enhanced expression of GAAP-1 induces erythroid differentiation and sensitises 
K562/GAAP-1 cells to differentiation induced by Ara-C or hemin. 

Discussion 

A cDNA encoding a DNA binding factor, which positively regulated the IRF-1 
and the p53 promoter activity through IPCSs, was cloned by the yeast one-hybrid 
system. Surprisingly, this cDNA, designated IPSC-BF1 cDNA, corresponded 
exactly to the 2.5 kb alternative transcript of the PRDI1-BF1 gene previously 
described by Gaynor and al (91). PRDII-BF1 was cloned independently by several 
groups (Fan 90, Baldwin 90, Muchardt 92) as a transcription factor implicated in 
the regulation of the IFNp, MHC I promoters, and the HIV-LTR. The binding site of 
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PRDII-BF1 on these regulatory sequences is an NF-kB recognition element. 
PRDII-BF1 is a zinc finger protein possessing two pairs of zinc fingers, each 
capable of binding individually an NF-kB site (Fan 90 Gaynor 91). We have shown 
that GAAP-1 , which possess only one pair of fingers, is able to bind specifically to 
at least two kinds of unrelated DNA sequences, the IPCS and the NF-kB site. In 
order to elucidate this double specificity of binding, we performed EMSA using in 
vitro translated GAAP-1 or mutated forms of this factor with either IPCS or NF-kB 
probes. The destruction of either of the two zinc fingers eliminates the binding to 
IPCS or NF-kB sequences, demonstrating that both zinc finger structures are 
required for both binding activities. This result suggests that amino acids others 
than those engaged in zinc finger formation (His and Cys residues) are implicated 
in the specific binding with the NF-kB or the IPCS motifs. A multiple specificity of 
DNA binding has already been reported for the WT1 proteins, whose multiple 
specificity relies on amino acid residues modified by alternative splicing between 
two zinc fingers (Hewitt 96). To our knowledge, GAAP-1 represents the first 
description of a zinc finger protein having fingers with a double specificity of 
binding. 

The in vitro translated GAAP-1 possesses an affinity for the IPCS probe 
lower than that for the NF-kB probe. However, in vivo produced GAAP-1 tagged 
with EGFP has the opposite property, suggesting that GAAP-1 is subjected to post 
translational modification which increases specifically its affinity for IPCS. Western 
blot analysis performed on the in vivo produced fusion proteins GAAP-1 /EGFP- 
BF1 and EGFP/GAAP-1 suggests that at least one mechanism may be implicated 
in the control of GAAP-1 binding affinity. Indeed, this experiment clearly 
demonstrates that a proteolytic cleavage occurs in vivo at approximately 100 
residues from the C terminus of GAAP-1 which could be determinant for the 
binding properties of the protein. Thus, our data may explain the results obtained 
by Gaynor et al. v who showed that the in vitro product of the differentially spliced 
2.5 kb mRNA, which corresponds to GAAP-1 , has a strong binding activity for a 
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NF-kB sequence. These observations suggest that the physiological role of GAAP- 
1 requires post-translational modifications. 

The activity of GAAP-1 is also regulated by its sub-cellular localisation. 
Although previous studies (Fan 90) have localised PRDII-BF1 exclusively in the 

nucleus # -we_have_demonstrated_that_GAAP=1_resides-in-both-the-nucleus-and 

cytoplasm. This difference may be ascribed to the alternative use of the two NLS 
present in the PRDII-BF1 gene. The first NLS was identified by Fan in PRDII-BF1 
and is not encompassed by GAAP-1 . This NLS may be responsible for the strictly 
nuclear localisation of PRDII-BF1. We have identified a second NLS, without which 
GAAP-1 cannot be imported in the nucleus. This observation underlines the fact 
that GAAP-1 is a transcription factor possessing its own NLS and functionally 
independent of PRDII-BF1. While the whole GAAP-1 is localised both in the 
nucleus and cytoplasm, the protein deleted of the PEST domain is strictly nuclear. 
The PEST sequence is recognised by ubiquitin-conjugating enzymes in the 
pathway which leads proteins to be degraded in the proteasome (Rechsteiner 96). 
Furthermore, the PEST domain may also be required for the export of transcription 
factors from the nucleus to the cytoplasm in order to be degraded in the 
proteasome (ref Jab1). Thus, the deletion of the PEST domain may retain GAAP-1 
into the nucleus, by abolishing the export of the protein. 

Although PRDII-BF1 has been identified on the basis of its interaction with 
the IFNp promoter (Fan 90) and the enhancers of MHC I and kappa 
Immunoglobulin genes (Baldwin 90), its transcriptional activity was demonstrated 
only in the case of the HIV enhancer (Tsujimoto 91,). In addition, it was 
demonstrated that the 90 kD and the 70 kD proteins, the last one corresponding to 
GAAP-1, which are coded by two differentially spliced messages of the PRDII-BF1 
gene, are devoid of any transcriptional activity on the HIV LTR (Gaynor). This 
observation is in agreement with our data and suggests that GAAP-1, contrary to 
PRDII-BF1, is not engaged in physiological transcription involving NF-kB binding 
sites. Our results, indeed, unambiguously demonstrate a positive transcriptional 
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activity of GAAP-1 on the p53 and IRF1 promoters, which is strictly correlated to 
the presence of an IPCS binding site on these promoters. Furthermore, this 
transcriptional activity is confirmed by the enhancement of the p53 and IRF1 
endogenous genes products shown in the U937 cells transfected with a GAAP-1 
expression vector. Thus, all these data permit us to define GAAP-1 as a 
transcription factor implicated in the positive transcription of at least the two anti- 
oncogenes IRF1 and p53. This property suggests that GAAP-1 may also act as an 
anti-oncogene. 

U937 cells over expressing GAAP-1 undergo apoptosis with a susceptibility 
which correlates with the levels of p53 and IRF1. This result suggests that the 
possible cause of cellular death is the increase of p53 and IRF1. It has to be 
pointed out that in U937 cells the p53 gene contains point mutations (ref). 
However, a transitory increase in the expression of the endogenous p53 was 
implicated (Gabriele 99) or is even required (Yeung 98) for the apoptosis of these 
cells, suggesting that p53, in spite of its mutations, is still active. Moreover, the 
increase in IRF1 expression may also contribute to apoptotic death, as has been 
already reported (Nguyen 97). On the other hand, U937 cell lines which express 
levels of GAAP-1 compatible with cell proliferation Exhibit increased sensitivity to 
genotoxic drugs. Thus, GAAP-1 may play an important role in the surveillance of 
the genomic integrity controlling the level of the key component p53. 

K562 cells, which do not express either IRF1 or p53, do not trigger 
apoptosis following GAAP-1 over-expression, but undergo erythroid differentiation. 
This result suggests that GAAP-1 also regulates the expression of unknown genes 
implicated in other cellular functions. 

In conclusion, our data suggest that the targets of the transcription factor 
GAAP-1 are multiple and are involved in processes concerning cell growth control, 
apoptosis and differentiation. 
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CLAIMS 

1. Use of a nucleotide sequence selected among: 

(i) a nucleotide sequence comprising the DNA sequence identified 
under No. 1, designated GAAP-1, 

(ii) a nucleotide sequence encoding a polypeptide having the amino-acid 
sequence identified under No. 2 or, 

(iii) a nucleotide sequence derived from sequence defined under (i) or (ii) 
wherein said sequence is modified providing that the resulting 
nucleotide sequence encodes a polypeptide capable of binding a 
nucleotide sequence designated IPCS which comprises the DNA 
sequence AAATGNNNNC, 

for the expression in a determined eucaryotic cell, of a polypeptide capable of 
interacting with said nucleotide sequence designated IPCS and capable of acting 
as a positive transcription factor for the transcription of a nucleotide sequence 
placed under the control of said IPCS sequence and present in said eucaryotic 
cell. 

2. Use of a nucleotide sequence according to claim 1 for the expression of a 
polypeptide acting as a positive transcription factor for a gene chosen among 
genes involved in the control of cellular growth, cellular proliferation, cellular 
differenciation or cellular apoptosis. 

3. Use according to claim 1 or 2 wherein the nucleotide sequence 
comprises the cDNA corresponding to the 2.5kb coding sequence of the transcript 
of the PRDII-BF1 gene. 

4. Use according to claim 3 wherein a start codon is added upstream from 
exon III of the PRDII-BF1 gene. 
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5. Use according to claim 3 wherein the nucleotide sequence comprises the 
succession of exons III, V, VI, VII, VIII and IX of the PRDII-BF1 gene, said 
nucleotide sequence being devoid of exon IV of said PRDII-BF1 gene. 



6. Use according to anyone of claims 1 to 3 in which the nucleotide 
sequence codes for a GAAP-1 polypeptide comprising the amino acid sequence 
identified under No. 2. 

7. Use according to anyone of claims 1 to 5, wherein the nucleotide 
sequence codes for a variant of GAAP-1 polypeptide, said variant being derived 
from GAAP-1 by insertion, deletion or substitution of one or several amino acid 
residues, provided it retains the property of GAAP-1 to bind an IPCS sequence 
and to act as a transcriptional factor for the for the transcription of a nucleotide 
sequence placed under the control of said IPCS sequence and present in a 
eucaryotic cell. 

8. Use according to anyone of claims 1 to 6 in which the nucleotide 
sequence hybridises under stringent conditions with the DNA sequence identified 
under No. 1 . 

9. Use of a nucleotide sequence according to anyone of claims 1 to 8, 
wherein said nucleotide sequence is placed under the control of a promoter 
sequence selected among constitutive or inducible promoters. 

10. Use of a nucleotide sequence according to anyone of claims 1 to 9 
wherein the eucaryotic cells are malignant cells. 
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11. Use according to anyone of claims 1 to 10 wherein the eucaryotic cells 
are those of a developed tumor. 

12. Use of a nucleotide sequence according to anyone of claims 1 to 9 for 
the control of cell apoptosis. 

13. Use of a nucleotide sequence according to anyone of claims 1 to 12 
wherein the positive regulation of transcriptional activity is obtained for a gene 
selected among p53, IRF1, Rb, p21 (WAF1) f p27, wt1, bax, TNF receptor and 
FAS. 

14. Use of a nucleotide sequence according to anyone of claims 1 to 13 
wherein the positive regulation of transcriptional activity allows regulation of 
transcription of several genes in the treated cells. 

15. Nucleotide sequence comprising the DNA sequence identified under 
No. 1, said nucleotide sequence being devoid of the sequence forming exon IV in 
the PRDII-BF1 gene. 

16. Nucleotide sequence according to claim 15 which consists of the DNA 
sequence identified under No. 1. 

17. Nucleotide sequence derived from the nucleotide sequence according to 
anyone of claims 15 or 16 and which is selected from the group consisting of: 

(i) a fragment of the DNA sequence identified under No. 1 , which can 
be used as a specific probe to detect the presence of said DNA 
sequence identified under No. 1 or a mutated sequence thereof, 

(ii) a nucleotide sequence encoding a polypeptide having the amino-acid 
sequence identified under No. 2, 
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(iii) a nucleotide sequence derivedfrom sequence defined under (i) or (ii) 

wherein said sequence is modified providing that the resulting 
nucleotide sequence encodes a polypeptide capable of binding a 
nucleotide sequence designated IPCS and comprising the DNA 

sequence AAATGNNNNC , said nucleotide seq uence being capable, 

when used in appropriate conditions, of expressing in a determined 
eucaryotic cell of a polypeptide capable of interacting with the 
nucleotide sequence designated IPCS and capable of acting as a 
positive transcriptional factor for the transcription of a. nucleotide 
sequence placed under the control of said IPCS sequence and 
present in said eucaryotic cell. 

18. Nucleotide sequence which is contained on plasmid pGAAPI deposited 
at the ECACC under n° 01052921 on May 29, 2001. 

19. Recombinant polypeptide, being the product of the expression in a 
eucaryotic cell, of a nucleotide sequence coding for a polypeptide capable of 
interacting with the nucleotide sequence designated IPCS and capable of acting 
as a positive transcription factor for the transcription of a nucleotide sequence 
placed under the control of said IPCS sequence and present in said eucaryotic 
cell. 

20. Recombinant polypeptide, being the product of the expression in a 
eucaryotic cell, of a nucleotide sequence according to anyone of claims 15 to 18. 

21. Recombinant polypeptide according to anyone of claims 19 to 20 which 
regulates the transcriptional activity of a gene selected among p53 and IRF1 when 
it is expressed in a eucaryotic cell constitutively expressing said gene. 
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22. Recombinant polypeptide according to anyone of claims 19 to 21 which 
has an apparent molecular weight of 75 kDa by SDS PAGE electrophoresis. 

23. Recombinant polypeptide, being the expression product in a eucaryotic 
cell selected among mammal cells, of a nucleotide sequence according to 
anyone of claims 15 to 18, said recombinant polypeptide including post 
translational modification enabled in said eucaryotic cell. 

24. Recombinant polypeptide according to anyone of claims 19 to 23 which 
comprises the amino acid sequence identified under No. 2. 

25. Recombinant polypeptide according to anyone of claims 19 to 23 which 
comprises a fragment of the amino acid sequence identified under No. 2 
provided the polypeptide is capable of interacting with the nucleotide sequence 
designated IPCS to act as a positive transcription factor for the transcription of 
a nucleotide sequence placed under the control of said IPCS sequence and 
present in said eucaryotic cell. 

26. Recombinant eucaryotic cell which is recombined by insertion of a 
nucleotide sequence according to anyone of claims 1 5 to 1 8. 

27. Recombinant eucaryotic cell according to claim 26 which expresses the 
polypeptide encoded by the inserted nucleotide sequence. 

28. Recombinant eucaryotic cell according to claim 26 or 27 which is a cell 
selected among cells normally expressing genes involved in the control of cell 
growth, cell differenciation, cell proliferation or cell apoptosis. 
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- 29. Recombinant eucaryotic cell according to anyone of claims 26 to 28 
which is a malignant cell. 

30. Recombinant eucaryotic cell which is recombined with a nucleotide 
s equence encoding a polype ptide capable of bindin g a nuc leotide sequence 
designated IPCS and comprising the DNA sequence AAATGNNNNC, to 
enable, in appropriate conditions, the expression of a polypeptide capable of 
interacting with the nucleotide sequence designated IPCS and capable of 
acting as a positive transcriptional factor for the transcription of a nucleotide 
sequence placed under the control of said IPCS sequence and present in said 
eucaryotic cell. 

31. Recombinant eucaryotic cell according to anyone of claims 26 to 30 
wherein the polypeptide comprises the aminoacrd sequence identified under 
No. 2. 

32. Recombinant eucaryotic cell according to anyone of claims 26 to 31 
which is selected among U937, K562, SK-N-SH, MCF7, KG1, TF1. 

33. Recombinant vector which is recombined with an insert consisting of a 
nucleotide sequence according to anyone of claims 1 5 to 1 8. 

34. Recombinant vector according to claim 33 which is an expression vector 
suitable for expression of said insert, in a eucaryotic cell. 



35. Recombinant vector according to claim 34 which is suitable for transient 
or controlled expression of said insert. 
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36. Recombinant vector according to anyone of claims 34 or 35 wherein 
said insert is placed under the control of a promoter regulated by a 
physiologically acceptable compound. 

37. Recombinant vector according to anyone of claims 34 or 35 wherein the 
transcription of said insert is placed under the control of an exogenous 
transactivating system. 

38. Recombinant vector according to anyone of claims 34 to 37 which is 
suitable for gene therapy. 

39. Recombinant vector according to claim 38 which is selected among 
viral, retroviral, lentlviral, poxviral, adenoviral, AAV vectors. 

40. Composition suitable for a therapeutic use which comprises a nucleotide 
sequence according to anyone of claims 15 to 18 or a recombinant cell 
according to anyone of claims 26 to 32, or a recombinant polypeptide 
according to anyone of claims 19 to 25. 

41 . Composition according to claim 40 which is used in combination with a 
therapeutic agent selected among antiviral agents and anticancer agents 
wherein said combined use is permitted by the time - or site - associated or 
separated use of the composition and of the therapeutic agent. 

42. Composition according to claim 41 wherein the anticancer agent is an 
immuotherapeutic agent, a chemotherapeutic agent or is radiotherapy. 

43. Composition according to anyone of claims 40 to 42 for the treatment of 
a malignant cell. 
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44. Use of a nucleotide sequence according to anyone of claims 15 to 18 or 
of a recombinant cell according to anyone of claims 26 to 32, or a recombinant 
polypeptide according to anyone of claims 19 to 25, for the preparation of a 
biological tool for screening compounds capable of interacting with an iphs 
sequence present in the promoter sequence of a gene involved in the control of 
cellular growth, cellular proliferation, cellular differenciation or cellular 
apoptosis, and capable of regulating the transcription of a nucleotide sequence 
placed under the control of said IPCS sequence. 

45. Process for the in vitro detection of a deficient BRDII-BFI gene 
comprising the steps of: 

- contacting a probe constituted from the nucleotide sequence identified under 
No. 1 or a fragment thereof comprising the zinc finger binding domains 
corresponding to the domains localised within exon VI of the BRDII-gene, with 
the DNA of a cell normally constitutively expressing said gene, in stringent 
conditions, 

- detecting the hybridisation product between said probe and said cell DNA. 

46. Process for the in vitro detection of a deficient transcriptional activity of 
genes involved in the control of cell growth, cell differenciation, cell proliferation or 
ceil apoptosis, comprising the step of detecting a deficient production of the 
transcript of said gene which would normally encode a polypeptide capable of 
binding an IPCS sequence and as a result would positively regulate the 
transcription of a nucleotide sequence placed under the control of said IPCS 
sequence. 

47. Process for the in vitro detection of a prognostic for transformation of 
cells toward a malignant state, which comprises the step of detecting a mutation in 
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the PRDII-BFI gene normally expressed in said cells, or detecting a mutation in the 
transcript obtained by splicing of said gene, which mutation would result in lack of 
expression or in an abnormal expression of polypeptide expression product of said 
PRDII-BF1 gene capable of binding to an IPCS sequence. 

48. Process for the screening of compounds capable of regulating the 
transcriptional activity of genes containing an IPCS sequence in their promoter 
region, said genes being active in the control of cell growth, cell differentiation, cell 
proliferation, or cell apoptosis, said process comprising the steps of : 

- contacting the assayed compounds with the DNA of a cell expressing genes 
containing an IPCS sequence in their promoter region, 

- detecting a DNA-compound complex formation and assaying its transcriptional 
activity on said gene containing the IPCS sequence. 

49. Use of a recombinant polypeptide according to anyone of claims 19 to 
25 for screening molecules suitable for use as agonist of GAAP-1 . 

50. Use of a recombinant polypeptide according to anyone of claims 19 to 
25 for screening molecules suitable for use as antagonist of GAAP-1 . 



! 
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ABSTRACT 



Constitutive expression of the tumor suppressor genes p53 and IRF1 
(interferon regulatory factor 1) is required for the maintenance of the cellular 
growth control during genotoxic damage or aberrant proliferation. A transcriptional 
regulatory element within the promoter region of both the p53 and IRF-1 genes 
which binds a positive transcription factor, denoted GAAP-1 (IRF1 p53 common 
sequence binding factor). The invention relates to the identification and cloning of 
the GAAP-1 cDNA and to the identification of the encoded protein corresponding 
to the 75 kDa product of the alternatively sliced PRDII-BF1 messenger RNA 
precursor. 
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Figure 1B 
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Figures 
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1PCS-P53 probe MHCNFkB probe 




12-3 45678 12 3 456 7 8 



C 

3AAP-1 wt . . CEECGI RCKKPSMLKKHIRT HTDVRPYH CTYGNFSFKTKGNLTKHMKSKAH.. 

3AAP-1 mtZ1 ... 3E ORCKKPSMLKKHIRT HTDVRPYH CTYCNFSFKTKGNLTKHMKSKAH. 

GAAP-1 mt?2 ...CEECGIRCKKPSMLKKH1RT HTDVRPYH CTYCNFSFKTKGNLK MK*A 
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Figure 8 
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Figure 10 
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Figure 12 
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1 CTAP-1 COSES SEQUENCE ^\^U>UL- -4 ^ 

V IT 

C^ftPl cQMOraaGRM^^ 290 
GftAEL CTTOCTQm^ 350 

renn criacrafEM^^ 350 



Cftftpi Gftnaa ggaE Qs a gr^^ 420 

P RI >t t t3V*ft3CTCTCEft^^ 430 

gbapi aaaaroaaG^^ 560 

WOU AftaMCTMIGftC^^ 560 

GAAP1 CXCGC3VGQOCTroCKXGX^ 630 

EBDD C00GC3O3CCi^\juaTC 630 

GMTS C^^ 700 

flRDPE GS CTKXOTTTO^ ^ 700 
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seed: gsaraaQaaaaacagOT iTOi^ ^ 

GWBl COXZO^ICmTtmri«G0^ ^ 1120 

a 0 * 8 ^^'*^^ 1330 

<3aBB1 rc* 300 ^^ 1400 

B ^ TOt3BOC ^^ 1400 

CMX3 ^ u « oaaK ^^ 1470 

58011 CRCftcKn ' a ^ iao 

I 
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CTftEl AtTBCfECT&aaCSV^^ 1750 

HflXTT ACnCTOEAQCX^^^ 1750 

PHunx Aftwaocftaan^^ 1320 

GBftPl AA3EGK2&C«^ 1890 

SSPET J»KK3&C»C20fc^^ 1890 

Head a^aocacxxA\j^^ igso 

GWRL aj^flClCTClC^^ 2030 

HOEC ASftCICTgiCTX^^ 2030 

GftftKL C2UX)Gfia3IOftQCTGOG5^^ 2084 

H0Jt£i eagoBaagreagcaQOGK^^ 2034 



2 <3*ftP-l 



I 



CS^Pl IXraGVHlDECCfVI*^^ 280 
EKBH rX^^SVHTOHi^L^^ 280 



19/20 



GR&KL 

OTH YESSBEOXSStSSGSaa 



<SWEL 

m&n i, 



BSDSZ 
GAKEO. 



350 



WSVW3REDG1HRII3V 420 
420 



490 
490 



560 
560 



630 
630 



J3GL2 



JD3L 



SEDCEEBLVIRa? 



694 



20/20 



Figure 16 
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